CDS

Accession Number TCMCG075C26410
gbkey CDS
Protein Id XP_017983642.1
Location complement(join(7413839..7414192,7414362..7414474,7414713..7414788,7414902..7415051,7415577..7415654,7415759..7415839,7415933..7415998,7416081..7416146,7416256..7416306,7416467..7416550,7417110..7417210,7417302..7417443,7418348..7418445,7418590..7418659,7418753..7418851))
Gene LOC18588828
GeneID 18588828
Organism Theobroma cacao

Protein

Length 542aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_018128153.1
Definition PREDICTED: putative clathrin assembly protein At5g35200 [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category TU
Description Clathrin assembly protein
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko04131        [VIEW IN KEGG]
KEGG_ko ko:K20043        [VIEW IN KEGG]
ko:K20044        [VIEW IN KEGG]
EC -
KEGG_Pathway -
GOs GO:0005575        [VIEW IN EMBL-EBI]
GO:0005623        [VIEW IN EMBL-EBI]
GO:0005886        [VIEW IN EMBL-EBI]
GO:0016020        [VIEW IN EMBL-EBI]
GO:0044464        [VIEW IN EMBL-EBI]
GO:0071944        [VIEW IN EMBL-EBI]

Sequence

CDS:  
ATGTCTGGCGGGGGTACTCAGAAGAGCTTGAGGAAAGCACTTGGAGCCATCAAGGATACCACCACTGTTTCATTGGCTAAAGTCAATAGTGATTATAAGGAATTGGATATTGCTATAGTTAAGGCCACAAATCATTATGAACGTCGTGCAAAGGAAAAACATATAAGAGCTATTTTTGCTGCCATTTCAGCTACTAGGCCTCGGGCTGATGTTGCCTATTGCATCCAGGCTCTTGCAAGGCGGCTATCAAGGACTCATAATTGGGCGGTTGCATTGAAAACTTTAATAGTCATTCATCGTGCTCTGAGGGAGGTGGACCCTACATTTCATGAAGAAGTCATTAACTATGGCAGAAGTAGAAGCCATATGCTTAACATGTCTCATTTCAAGGACGATTCCAGCCCAAATGCATGGGATTATTCTGCCTGGGTTCGCACTTATGCCTTATTCTTGGAGGAGAGGCTGGAATGTTTTCGTGTCTTGAAGTATGATATTGAGATGGACCGCCCAAGAACGAAAGATTTGGACACTGCAGAGTTGCTTGAGCAGTTGCCAGCTTTGCAACAGCTCCTTTTCCGCGTTCTTGGCTGTCAGCCACAAGGTGCAGCTGTTCATAATTTTGTAATCCAGTTGGCGCTTTCAATGGTGGCTACTGAAAGTGTCAAAGTTTATCAAGCTATAAGTGATGGTACAGTCAATCTCGTAGACAAGTTCTTTGAGATGCAACGTCCAGATGCTATAAAGGCTTTGGATATATACAGGAGATCGGGGCAACAGGCCGAGAGACTTTCAGAATTTTACGAAGTATGTAAAAGTCTTGATGTTGGGCGTGGAGAAAGGTTTATTAAGATAGAGCAGCCACCTGCATCATTTCTACAAGCCATGGAAGAGTATGTAAGAGAAGCTCCACGGGCTTCAACAGTTCGCAAAGATCAGGTTGACAAACCCAAGGAGGTGTTGGCCATTGAGTACAAGAAAACCCTGGAGGTGCAGGAGGAATGTAAACGTTCACCATCACCTCCTCCTCCTGAACCAGAAAAAGTGGAGAAAGTGGAAGAGCCTATTGTTGAACCACCTGATTTGTTGGGTTTAAATAATTCTGTCCCAGTTGCTTCAGAATTAGATGAGAAGAATGCCTTGGCATTGGCTATTGTTCCTGCTGAGCAAATGACTTCTGCTGCTGCTCCTGTTCAAACCAATGGTACTACTGGCTGGGAATTGGCACTTGTCACTGCTCCAAGTTCAAATGACAGTGCTACTGCTGCTAGCAAACTGGCGGGAGGACTTGACAAGCTTACATTAGACAGCCTGTATGATGATGCAATCAGAAGAAGCAACCAGAGCGTGACCTATAATCCCTGGGAACCAGCTCCTATGTCTGGTGCCATGATGCAACAACCAGCGCATGACCCCTTTTATGCTTCCAACATGGTTCCTGCTCCACCTTCAGTCCAGATGGCAGCAATGGCCAATCAGCAGCAGGCTTTTATGTTGCAGCAGCAGGTGATGATGATGGGCCCGCAACAGCAGGCTTCAAATCCTTTTGGCAATCCTTATGGAGCCAGTGTCCACCCTTACGGCTCAGGTATGCCAGTTCAAGCACACAATCCATATACAGGCCTTCTATAG
Protein:  
MSGGGTQKSLRKALGAIKDTTTVSLAKVNSDYKELDIAIVKATNHYERRAKEKHIRAIFAAISATRPRADVAYCIQALARRLSRTHNWAVALKTLIVIHRALREVDPTFHEEVINYGRSRSHMLNMSHFKDDSSPNAWDYSAWVRTYALFLEERLECFRVLKYDIEMDRPRTKDLDTAELLEQLPALQQLLFRVLGCQPQGAAVHNFVIQLALSMVATESVKVYQAISDGTVNLVDKFFEMQRPDAIKALDIYRRSGQQAERLSEFYEVCKSLDVGRGERFIKIEQPPASFLQAMEEYVREAPRASTVRKDQVDKPKEVLAIEYKKTLEVQEECKRSPSPPPPEPEKVEKVEEPIVEPPDLLGLNNSVPVASELDEKNALALAIVPAEQMTSAAAPVQTNGTTGWELALVTAPSSNDSATAASKLAGGLDKLTLDSLYDDAIRRSNQSVTYNPWEPAPMSGAMMQQPAHDPFYASNMVPAPPSVQMAAMANQQQAFMLQQQVMMMGPQQQASNPFGNPYGASVHPYGSGMPVQAHNPYTGLL